Biological Data Handling Methods
نویسندگان
چکیده
Biological data has more variation in type and format compared to other types of data. Thus, it poses new challenges. However, it encapsulates critical information; thus, handling it is of primary interest. Data handling includes storage and retrieval of data with associated formats and methods of data transfer, data format conversion, algorithms that run on the data and the output methods including visualization of the results. High throughput methods have been yielding biological data at a fast pace. This data includes protein-protein interactions, gene sequences, gene co-expressions, and protein sequences. This data is supplemented with huge amounts of clinical data conveniently captured in electronic medical records and the wet lab data. We describe the current approaches, each with a model system and identify its key contributions. We propose some ideas for biological data handling in the future.
منابع مشابه
BioNetLink - An Architecture for Working with Network Data
The visualization of biological data gained increasing importance in the last years. There is a large number of methods and software tools available that visualize biological data including the combination of measured experimental data and biological networks. With growing size of networks their handling and exploration becomes a challenging task for the user. In addition, scientists also have ...
متن کاملDesign of Hybrid Differential Evolution and Group Method of Data Handling for Inductive Modeling
The group method of data handling (GMDH) and differential evolution (DE) population-based algorithm are two well-known nonlinear methods of mathematical modeling. In this paper, both methods are explained and a new design methodology which is a hybrid of GMDH and DE is proposed. The proposed method constructs a GMDH network model of a population of promising DE solutions. The new hybrid impleme...
متن کاملComputing Platforms for Big Biological Data Analytics: Perspectives and Challenges
The last decade has witnessed an explosion in the amount of available biological sequence data, due to the rapid progress of high-throughput sequencing projects. However, the biological data amount is becoming so great that traditional data analysis platforms and methods can no longer meet the need to rapidly perform data analysis tasks in life sciences. As a result, both biologists and compute...
متن کاملSubstructural QSAR approaches and topological pharmacophores.
For large and diverse data sets, simple QSAR methods based on linear and additive models can no longer be applied. In such cases topological methods using descriptors directly derivable from two-dimensional chemical structures provide a useful alternative. The results of such analyses can be used for lead optimization, to guide biological testing and even aid in the design of novel compounds. V...
متن کاملModeling Correlation in Incomplete Longitudinal Data: The Case of Fruit Fly Mortality Data
Longitudinal studies are prevalent in clinical trials, biological and social sciences where subjects are measured repeatedly over time. Modeling the correlations of repeated measurements on the same subject and handling missing data are challenging problems in the statistical analysis of such data. The situation is exacerbated knowing that the presence of missing data can hamper modeling of dep...
متن کامل